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Abstract — This work investigates tlie diffusion of cooperative 
behavior over time in a decentralized cognitive radio network 
with selfish spectrum-sensing users. The users can individually 
choose whether or not to participate in cooperative spectrum 
sensing, in order to maximize their individual payoff defined 
in terms of the sensing false-alarm rate and transmit energy 
expenditure. The system is modeled as a partially connected 
network with a statistical distribution of the degree of the users, 
who play their myopic best responses to the actions of their 
neighbors at each iteration. Based on this model, we investigate 
the existence and characterization of Bayesian Nash Equilibria 
for the diffusion game. The impacts of network topology, channel 
fading statistics, sensing protocol, and multiple antennas on the 
outcome of the diffusion process are analyzed next. Simulation 
results that demonstrate how conducive different network sce- 
narios are to the diffusion of cooperation are presented for 
further insight, and we conclude with a discussion on additional 
refinements and issues worth pursuing. 

Index Terms — Cognitive radio, cooperative spectrum sensing, 
diffusion, network games, Bayesian Nash Equilibrium. 

I. INTRODUCTION 

Dynamic spectrum access (DSA) by interweave cognitive 
radios (ICRs) is emerging as a promising solution to enable 
better utilization of the radio spectrum, especially in bands that 
are currently under-utilized lH], ||2]. DSA partitions wireless 
terminals into categories of primary (licensed) and secondary 
(cognitive radio) users, where the primary users (PUs) have 
priority in accessing the shared spectrum. ICRs are allowed to 
opportunistically use the spectrum only when it is not occupied 
by primary transmitters (PTs) that have priority. Therefore, the 
ICRs do not cause interference to the PUs in principle. In the 
absence of standard control channels or coordinated medium 
access between the primary and secondary users, the ICRs 
must periodically sense the spectrum for the presence of PTs 
and cease transmission upon detection. Local spectrum sensing 
(LSS) algorithms where each ICR makes an independent 
decision on whether the spectrum is available have been 
studied extensively, e.g., |[3]|-||6l. 

LSS may fail to provide sufficient accuracy due to the 
vagaries of the wireless medium such as deep fades and 
shadowing. As a remedy, cooperative spectrum sensing (CSS) 
has been shown to greatly increase the reliability of spectrum 
sensing, with a corresponding increase in complexity and 
energy consumption [71. Under CSS, each ICR either sends 
its local sensing data/decision to a central collector known as 
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the fusion center ll8l- lfT4l . or shares this information with its 
neighbors in the case of distributed ICR networks lfT5l - lfT6l . 
However, the fundamental assumption in the CSS studies cited 
thus far is that all ICRs willingly engage in cooperative sensing 
in order to optimize a global performance metric. In decentral- 
ized networks, ICRs can potentially pursue selfish motives and 
make independent decisions regarding whether to cooperate 
with their peers via CSS or to act alone by adopting LSS. 
Therefore, there has been recent interest in game-theoretic 
models of decentralized networks where ICRs act selfishly 
in order maximize individual utilities lfT7l - ll22l . Yuan et al. 
1 17] study generalized Nash Equilibria of a non-cooperative 
game where the ICR throughput features in the utility func- 
tion. Evolutionary game models and associated evolutionary 
stable strategies with throughput as payoffs are formulated in 
ifTSl . lfT9l . Cooperative game-theoretic approach for coalition 
formation in fixed network topologies are presented in ll20l . 
1211 . while |22i | assumes the presence of a fusion center 
and ICRs non-cooperatively optimize the frequency over time 
with which they participate in CSS. This paper has several 
major differences from llT7l - ll22l as we analyze Bayesian 
Nash Equilibria under an imperfect information scenario, base 
the ICR utility function on the sensing false-alarm rate, and 
explicitly consider a complex, partially-connected network 
topology with an arbitrary degree distribution. 

On a broader level, there has been extensive recent work 
on distributed estimation or detection of a single parameter of 
interest without the need for a central fusion center Diffusion 
strategies for distributed estimation/detection in decentralized 
networks where all users cooperate have been studied in 
Il23l - ll28l . for example. Consensus algorithms for in-network 
computation have been presented in ifTSl . lfT6l . Il29l among 
others. While US), US, ESj-Jlll focus on the diffusion of 
information between collaborating peers, this work is focused 
on the diffusion of cooperation across a decentralized inter- 
weave network composed of selfish users. In other words, 
we are interested in determining conditions under which a 
given ICR network converges towards or diverges away from 
global cooperation over time when users individually adapt to 
the behavior of their peers, with information exchange being 
implicit. 

The motivation to study the diffusion of cooperation is 
as follows. The autonomous nature of the distributed ICR 
network makes it difficult for the cognitive network designer 
to predict the behavior of the ICRs, and whether a steady-state 
outcome exists for the network as a whole, in terms of what 
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fraction of ICRs end up cooperating. This is a critical question, 
since the fundamental basis of deploying spectrum-sensing 
ICRs is to maximize the utilization of unused spectrum - 
which is achieved when the ICR false-alarm rate is minimized 
(subject to a primary detection probability constraint). As 
the per-ICR false-alarm rate is minimized when all ICRs 
participate in CSS, studying the diffusion of cooperation in 
the network and the corresponding steady-state properties 
illustrates how close or far the distributed network is from this 
ideal outcome. The diffusion analysis in this work shows how 
key physical and network parameters such as sensing protocol, 
network degree distribution, number of antennas, shadowing 
correlation and path loss exponent, etc. impact the steady-state 
cooperation outcomes of the distributed ICR network. 

The paper is organized as follows. Section introduces 
the mathematical model of the decentralized ICR network and 
the spectrum sensing performance of the ICRs. The game- 
theoretic model and equilibrium properties of the diffusion 
process are presented in Sec. |III] Sec. |IV] examines the 
impact of various network and sensing parameters on the 
diffusion process. Selected numerical examples are shown in 
Section [V] followed by a discussion of further research issues 
in Section |VI] and we conclude in Section IVIII 

Notation: We will use J\f{m., Z) to denote a multivari- 
ate Gaussian distribution with mean m and covariance ma- 
trix Z, and define the Gaussian Q-function as Q (x) = 
du. We also use £{■} to denote expecta- 
tion, (•)^ for the transpose, (•)^ for the Hermitian transpose, 
(•)^^ for the matrix inverse, x_i to denote a vector excluding 
its i*'' component, [AJ^^- is the element of matrix A, 

and 1 is a column vector of all ones. 



II. Mathematical Model 
A. Network Model 
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As shown in Fig. 1, the system under consideration is 
composed of a PT and n single-antenna ICRs that form 
a partially-connected network represented by the undirected 
graph Q (V, E), with V = 1, . . . ,ti, and E being the set of 
edges or active peer-to-peer links. The i*'^ ICR has a degree 
di drawn from a probability distribution P{d) with support 

d = 0,1,..., A such that J^d^o^i^^) = 1- Each ICR is 
provided prior knowledge of the degree distribution P{d) of 
the overall network by the network designer and knows its own 
degree di, but does not have any knowledge of the degrees of 
its neighbors. The network is decentralized in the sense that no 
central fusion center is present, and only localized peer-to-peer 
interactions are allowed. 

At each time instant of the diffusion process (to be defined 
in Sec. Ullb . the neighbors of each user are a random draw from 
the population specified by P{d), which indicates that a new 
network realization is perceived at every interaction epochQ- 
However, the degree of each ICR remains unchanged over 
time. By definition, degree distribution P{d) characterizes the 
degree of an arbitrary node anywhere in the network. We are 
now interested in the degree distribution P{d) of a neighboring 
node, i.e., given some node i with degree d.^, what is the degree 
distribution of its j*'* neighbor? It is immediately clear that 
this is not also equal to P{d), since P{d) includes degree-0 
nodes in its support that have no neighbors. Instead, consider 
a randomly chosen edge (link) in the network graph, which by 
definition exists between neighboring nodes. Once again, the 
degree of a node we reach by following a randomly chosen 
edge is not given by P{d). Since there are d edges that arrive 
at a node of degree d, we are d times as likely to arrive at that 
node than another node that has degree 1 . Thus, the probability 
that the neighboring node is of degree d is proportional to 
P{d)d; normalizing to yield a valid distribution function we 
obtain 1301 

P{d)d 



P{d) = 



j:dPid)d- 



(1) 



B. Spectrum Sensing Protocol 

We now specify the spectrum sensing algorithm employed 
by the ICRs to detect the presence of the PT. Assume that 
the ICRs are allowed to access the spectrum only if they are 
located outside a guard region of radius G centered on the 
PT. Let yi denote the SNR in dB (logarithmic scale) of the 
PT signal received at ICR i located at a distance from the 
PT. Under log-normal shadowing and distance-dependent path 
loss on the sensing channel, yi is distributed as a Gaussian 
random variable with variance and mean fj,{ri) ifTTl . lfT2l . 

For the CSS scenario with a cluster of c cooperating ICRs, 
the local SNRs are collected into y = [yi, . . . ,yc]^ at a 
randomly chosen participant within the group. We assume each 
ICR truthfully reports its actual local observation during CSS, 
and defer discussion of malicious behavior to Sec. |VT] Since 
the inter-neighbor distances are much smaller in magnitude 
compared to we assume that the SNR reporting is 

conducted without error or delay, and that the cooperating 



Fig. L Decentralized ICR network with cooperative and non-cooperative iThis assumptio n is co nsistent with a mean-field approximation as elabo- 
users of various degrees. rated upon in Sec. ITIFcl 
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ICRs are all roughly the same distance from the PT: ri w 
r2 ~ . . . ~ = r llT2l - |fT3l . However, the small intra-cluster 
size also implies that the SNR observations are correlated, such 
that the normalized covariance matrix S = a^^£ {yy^} is 
non-diagonal and of full rank. 

Therefore, the binary hypothesis test under CSS to deter- 
mine whether the cooperating cluster is within the guard region 
(hypothesis iJo) or not (hypothesis Hi) is 



Ho 
Hi 



y~AA(A* (r + 5)l,a2s), 



(2) 
(3) 



for some (5 > 0. We assume that both CSS and LSS must 
achieve a minimum probability of detection target jS, equiv- 
alent to a missed-detection probability constraint of (1 — 
The imposition of the constraint /? ensures a minimum level 
of protection for the PUs from unintentional ICR interference. 
The Neyman-Pearson-optimal likelihood ratio test lISTI that 
minimizes the probability of false alarm subject to the con- 
straint P is given by the decision rule llT2l 

l^S-iy ffi 
^ T 

with threshold T = fi{G) - Q'^ {P). Define M = 
fi{R + S) — ^ (R), A6 < 0. Under an exponential spatial cor- 
relation model j = /o'*^-' ', the corresponding probability 
of false alarm can be approximated as | li | 



(0 = 1-0 ^^ "71-;^" + 0- mj . 

(4) 

The false alarm probability under LSS is easily obtained from 
dill as 

Pfa,lss = 1-q(^^ + Q-' W)^ . (5) 

At first glance, the primary users appear to be equally 
protected from ICR interference under LSS and CSS, due 
to the detection probability constraint /3 imposed on both 
scenarios. However, from the perspective of the secondary 
network designer it is always desirable to have as many ICRs 
cooperate as possible. Since Pfa,css (c) is monotonically 
decreasing in c, reducing the false-alarm probability via CSS 
greatly improves the spectrum access opportunities of the 
ICRs. The reader is referred to Q, ll32l for specific details on 
protocols for CSS information exchange and medium access 
control after the sensing decisions. 

C. Cost of Cooperation 

The increased complexity of CSS incurs an additional cost 
in terms of energy consumption and delay relative to LSS 
iflTl . Since the diffusion process is iterative by definition, we 
assume the additional delay due to CSS is negligible compared 
to the diffusion time scale. The cost of additional energy 
consumption in reporting SNRs/decisions to a neighbor is 
modeled as follows. Assume that other ICRs in the proximity 
of arbitrary user i are its neighbors if they lie within a circle 
of radius R centered at i, with a total of di such ICRs. R is 
thus assumed to be the maximum communication range of the 



ICRs. Let random variable Y represent the distance between 
neighboring ICRs. The energy consumed by ICR i when it 
participates in CSS (incurred while either reporting i/i or the 
CSS decision) is assumed to be proportional to F": 



E = cY"' 



(6) 



with a as the path loss exponent of the network, and some 
proportionality constant c > 0. 

In sophisticated network geometry models, the user loca- 
tions are often assumed to be governed by a two-dimensional 
stochastic point process ll33l . Assuming n is asymptotically 
large, we can apply a Poisson point process (PPP) model of 
example intensity 

H = tt{R + Df/n 

where D ^ R, for which the cdf of the distance to the nearest 
neighbor 11341 is 



Fy (y) = 1 - e-f'^y . 

The cdf of the energy cost is then obtained as 

Fe{x) ==1-6-'^'^""''°^''°. 



(7) 



(8) 



We can similarly compute the cost distribution for various 
other statistical models of the user spatial locations. More 
generally, in the terminology of Bayesian game theory we 
can define the degree of a neighboring ICR and energy 
cost as its 'type', which is private information known only 
to itself. However, the statistical distribution of each user's 
type is publicly known to all nodes, which is a standard 
assumption in Bayesian games lf35l . Furthermore, note that we 
can replace the CSS and LSS protocols described above by any 
arbitrary choice of decision rule (e.g., energy detection, feature 
detection, hard decision combining) without altering the game- 
theoretic analysis of the diffusion process in the sequel. 

III. Diffusion Network Game 

A. Strategic Non-cooperative Game 

Having delineated the statistical properties of the ICR 
network and spectrum sensing protocols, we now model the 
diffusion process as a non-cooperative game that evolves over 
discrete time steps t ~ 0, 1, . . . ,L. Every ICR has a (pure) 
strategy set of two possible actions A = {0, 1} and plays 
a G A, where action a = corresponds to LSS and a — 1 
corresponds to choosing CSS. Let represent the probability 
at time t that an arbitrary neighbor anywhere in the network 
chooses to cooperate by playing a = 1. Generally, an ICR 
may participate in multiple clusters at a time. In that sense x^d 
is an approximation of the number of cooperative neighbors. 
Due to the communication range limit R and the fact that the 
CSS outcome is broadcast by the local cluster-head, all ICRs 
within a cluster are assumed to be direct neighbors of each 
other The probability that each ICR participates in more than 
one cluster per time step is therefore considered to be small. 
We then define the utility obtained by a degree-di ICR when 
it chooses to cooperate as 



1 



FA,CSS 



(x'd,) 



(9) 
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where Pfa.css ( ) is the cooperative false-alarm rate defined 
in The additional energy cost incurred by ICR i under CSS 
is represented by Ei drawn from distribution Fe{x) [cf. ([8])], 
and costs are i.i.d. across ICRs. Similarly, ICRs abstaining 
from cooperation obtain 



ud, (0,a;*) = 1 - Pfaxss- 



(10) 



The false-alarm probability is meaningful as a utility function 
since it has a direct impact on the ICR throughput. Since 
ICRs must refrain from transmission if they decide that a 
primary user is active, a higher false alarm rate corresponds to 
decreased opportunities for spectrum access. Therefore, max- 
imizing the ICR utility function is equivalent to minimizing 
the false-alarm rate, which is desirable for all ICRs. 

The return function v{di, x*-) represents the additional utility 
gained by a degree-d^ user that chooses to cooperate: 



V {di 



l.x' 



Ud, (0,x*) 



The user payoff functions are finally given by 

Ud^ (1,2:*) - Ci, a = 1 



a, X 



ud, (0,a;*) . 



a = 0. 



(11) 



(12) 



From (l9Tl-(fT2]i. we observe that (i) Ud^ (0,a;*) — Ud^ (a,0), 
(ii) Udi{a,x^) > Udi{a,x*) if i* > x* which implies the 
utilities exhibit positive externalities, and (iii) Ud^ (l,a;*) > 
Udi (0,x*) ydi > OjX* > 0. Therefore, a selfish ICR has an 
incentive to participate in CSS if and only if v {di, a;*) > Ei, 
which occurs with probability 



Pr{z; {d„ x')} > E,} = Fe {v [d,, x')) 



(13) 



At each time step, the ICRs play their myopic best responses 



bi (a_i) = argmaxlld {a,x ) Vi 



(14) 



to maximize their individual payoffs, based on the expected 
behavior of their neighbors parameterized by cc*. The diffusion 
of cooperation through the network is then captured by the 
evolution of over time. The game model is one of incom- 
plete information since each user is assumed to know only its 
own degree and cost; the degree and cost realizations of its 
neighbors are completely unknown. Our game model therefore 
differs significantly from IfTTl . where the utilities are defined in 
terms of throughput and a joint detection probability constraint 
is applied to all ICRs under CSS, leading to a coupled strategy 
space. The non-cooperative diffusion network game in strate- 
gic form is succinctly represented as r'^(V,^, 11^; (a,x*)). 
For the incomplete information scenario, steady-state rest 
points of the diffusion process are described by Bayesian Nash 
Equilibria (BNE) |'35l, which we characterize next. 

B. Structure of Equilibria 

Theorem 1: The set of Bayesian Nash Equilibria of the 
diffusion network game {y,A,\^di io,,x*)) is non-empty. 

Proof: The ICR strategy set A = {0, 1} is a compact 
subset over M, or equivalently, a sublattice of M. The return 
function v{di,x^) is increasing in x* for each feasible value 



of di, which can be verified either by inspection of (fTTt . or 
by verifying that its first derivative 



dv {di,x^) 
dx* 



1 



A6 



(1 - p) x*'di 



2ay/TTp ((1 - p) xhk + 2p)^/^ 



(15) 



is positive (recall that AS < 0). The same is true for the 
individual utility functions and for the payoff 11^; {a,x*). 
Thus, the best response correspondence bi is an increasing 
function on A 1361 . l37l . Define b = Yii the overall best 
response correspondence obtained from the Cartesian product 
of the individual responses; b is also an increasing function. 

Now, Tarski's fixed point theorem ll35l . ll38l states that "Let 
(5; >) with binary relation ' >' be a non-empty compact 
sublattice of M" and / : 5 — > 5 an increasing function on 
S, such that for x,y E S, y > x implies f{y) > f{x). Then 
the set of fixed points of / is non-empty." Therefore, based 
on the preceding discussion, the best response correspondence 
^ '■ Tli ^ Yii a non-empty set of fixed points. Since 

the set of fixed points of a best response correspondence is 
the set of pure BNE l35l . Theorem 1 follows. ■ 

The result in Theorem [T| is refined further as follows. 

Theorem 2: The strategic diffusion network game 
{V, A, Hdi {a, .T*)) has a unique BNE in pure strategies. 
Proof: For (V,A,Ild, (a,.T*)) to have a unique BNE, 
it suffices to show that the probability of cooperation 
Fe (v {di,x'^)) is concave in x* for each di ll36l . Defining 
7] = (1 — p) x*di + 2p, we first verify that v {di,x*) is concave 
by computing its second derivative 



d^v(^di,x*) 



dx^ 



A3((l-p)d.)^ ^-0.5(A^,/=S3+c 
2(Ty27r(l+p) 



AS 



(1+P) 



(/?) 



AS 



which is non-positive for each di. For the PPP spatial model, 
the cost distribution in ([8]l is concave in x since the second 
derivative 



d^FE {x) _ 2 
dx'^ a 



Ca;(2-2a)/ag-£a;=/° 



2cx' 



is negative, where we have defined c = pTTc~'^/°'. 
FE{v{di,x^)) for the PPP model is the composition of 
two concave functions, therefore it is also concave in x*. ■ 
A fixed-point characterization of the unique BNE can be 
obtained as follows. Since P (d) is the probability of having 
a neighbor of degree d and Fe {v {di,x*)) is the probability 
that a neighbor cooperates, due to the law of total probability 
we have 

x* = 0(x*) ^^^P(rf)FB(t;(d,x*)). (16) 

From Theorem m the BNE of the diffusion game must satisfy 
(O, and the point satisfying ( fT6] l must be the BNE l36l . 

The final piece of the puzzle relates to the achievability of 
the unique BNE specified above, which is resolved next. We 
will make use of the notion of a supermodular game with 
positive externalities, where user payoffs are increasing in the 
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actions of their neighbors ||39l . ||40l . In other words, in a 
supermodular game the actions of users mutually reinforce 
the decisions of their neighbors to follow the same action. 
A classic example is the power control game in distributed 
networks; an increase in the perceived interference power at a 
node triggers it to also increase its own transmit power, and 
so on 13911 . 

Theorem 3: The myopic best response dynamics of the 
selfish ICRs in ( fT4b is guaranteed to converge to the unique 
BNE of T'^{V,A, Hd, (a,x*)). 

Proof: Note that Hd^ {a,x*) is upper semi-continuous in 
a as the player transitions from a = to a = 1. Furthermore, 
the payoff function satisfies the property of having increasing 
differences in (a,a_i), i.e., 

Hd, ({aU'_J,x*) -Hd, ({a„a'_J,.T*) > Hrf. a-,}, a:*) 

- lid, ({ai,a_i},a;*) 

for all > ai and a'_j > a_i. We have already mentioned 
that Ais sublattice of R. Therefore, T'^(y,A,Ildi (a, a;*)) 
satisfies the properties of a supermodular game with positive 
externalities |f39l . BOl . and it is known that myopic best 
responses always converge to equilibrium in such games ll35l . 

m- ■ 

The existence and achievability of a unique BNE as shown 
above is especially useful since n-player supermodular games 
with two actions per player can have up to [^] pure-strategy 
BNE in general 1421 . The algorithmic description of the 
Bayesian CSS diffusion game introduced in this section is 
summarized below. 



Algorithm Bayesian CSS diffusion game 
Require: < e < lO"^^ ^ = q 
Initialization: 

Initialize x'^ with a random number between and 1; t — 

t + 1 

while — cf) > e do 

ICRs play best-response strategies 

bi (a_i) = argmaxllrf. (a,x*) Vz 

Update (j> (a;*+^) via fixed-point eq. (16); t = t + 1 
end while 



C. Time Dynamics of Diffusion Process 

Having established the properties of steady-state equilibria, 
it is also of interest to examine the evolution of the diffusion 
process over time. Since each ICR can be in one of two 
states (cooperative or non-cooperative) at each time instant, 
the diffusion process dynamics is a discrete-time Markov chain 
with n? possible states. A mean-field approximation alleviates 
the complexity of such stochastic dynamics by replacing 
the Markov-chain model with a deterministic discrete-time 
process. This approximation relies on the stochastic process 
remaining in the same subset space with a probability arbitrar- 
ily close to one, provided that the population is large enough 
l30l . The analytical complexity is further reduced by assuming 
that at each time instant the neighbors of each ICR are drawn 



randomly from the population. Therefore, the network degree 
distribution P{d) can be used to characterize the diffusion 
process instead of having to account for all possible network 
connections and topologies, which is nearly intractable ll30l . 

Eg), iS. 

An important metric that captures the evolution of the 
cooperative behavior of the network is the relative density 
(fraction) of cooperating ICRs ^* at time t. By definition, we 
have 

where is the relative density of degree-/c cooperative ICRs. 
Assuming the increments in time are arbitrarily small, the 
dynamic mean-field equation can be written as 

^ = -e, (1 - [v (fc,x*))) + euFE {v {k,x')) . 

Setting the derivative to zero yields the stationary condition 
£^1 = Fe [v (fc,a;*)). Thus at a steady-state BNE, we have 

^NE = Y.kP^^)PE{v{k,x')), (18) 

which verifies that ^* is increasing in x* due to Fe {v {k, x*)) 
having the same property. 

IV. Diffusion Outcomes 

A. Impact of Network Parameters 

The network parameters that potentially impact the outcome 
of the diffusion process are 

• Network degree distribution P{d) or P{d). 

• PT detection constraint jS. 

• Energy cost distribution Fe{x). 

• Sensing protocol. 

• Shadowing correlation and path loss exponent. 
Changes in any of the above parameters will either lead to a 
lower or higher equilibrium value of x*. Furthermore, some 
of these parameters are coupled and cannot be manipulated 
independently, for e.g., varying the path loss exponent will 
also alter the energy cost distribution. The network mapping 
(j> (a;*) defined in ( fT6b offers a means of comparing the extent 
to which cooperation spreads in different networks. Given two 
networks that differ in one parameter with all others being the 
same, we now present a framework to evaluate which of the 
two is more conducive to the diffusion of cooperation. We will 
require the following definitions. 

Definition 1: Given two random variables A and B with 
distribution functions FA{y) and Fsiy), if FA{y) > Fsiy) 
Vy E M, then A is first-order stochastically dominated by B, 
denoted as B ^ A. 

Definition 2: A network with map (j>' (x*) is said to be more 
conducive to diffusion compared to map 4>{x*) if 4>{x*) < 
(j> (x*) for each .t*, since that is equivalent to (j> (a;*) having a 
higher BNE point x*. If the converse holds then the network 
(j)' (x*) is less conducive to diffusion. 

We can then summarize the impact of varying a specific 
network parameter while fixing the remainder as follows. 

. Varying If P{d) >z P'{d), then </.(x*) = 

T.dPid:)FE{v{d,x')) > T.dP'{d)FE{v{d,y)) = 
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(j)' (x*). Increasing the probability of higher-degree neigh- 
bors aids diffusion. 

• Varying f3: The false-alarm probabihty of any feasible 
decision rule is non-decreasing in (3 (property of any 
receiver operating characteristic). For /3' < /3, we have 

PpA.cssic) < PFA,cssic) and Pfa,lss < Pfaxss- 
This implies v'{di,x^) > v{di,x*) and Fe {v' {di,x*)) > 
Fe {v ((ii,.T*)), therefore (f>' (x*) > <f>{x*). Lowering the 
detection probability constraint increases the gain from 
CSS while the cost remains unchanged, thus enhancing 
diffusion. 

• Varying Fe{x): It is obvious that for Fe{x) >: F'^{x), 
we have (p' (a;*) < (f> (x*). Increasing the cost of cooper- 
ation can only hinder the diffusion process. 

• Varying tlie sensing protocol; Since the sensing pro- 
tocols in Sec. III-BI are Neyman-Pearson optimal, for any 
competing sensing protocol with metrics Pp^ cssi'^) ^^'^ 
P'fa,lss^ we must have PpAfissi^^) ^ Pfa,css{c) and 
P'fa lss — Pfa,lss for the same detection constraint 
(3. This implies v'{di,x*) < v{di,x^) and ultimately 
4>' (2;*) < 4>{x*')- Deviating from the optimal likelihood 
ratio test decreases the likelihood of cooperation. 

• Varying p: For shadowing correlation coefficients p and 
p\ if p' > p, since the term ^{(1 — p) c + 2/9)/l + p 
in (IHi is decreasing in p we obtain 4>' {x^) < 4>{x*)- 
Increased spatial correlation diminishes the gain from 
CSS and makes cooperation less likely. 

• Varying a: For path loss exponents a, a', if a > a' then 
Fe{x) >: F'^{x) and <}>' (x*) > (a;*). Increasing the cost 
distribution clearly inhibits diffusion. 

B. Impact of Multiple Antennas 

While the development thus far has considered the case 
of single-antenna users, it is worthwhile to investigate the 
multiple-input multiple-output (MIMO) scenario where the 
ICRs are equipped with M antennas each. The most notable 
impact is on the performance of the LSS and CSS likelihood 
ratio tests. Specifically, we must now account for the correla- 
tion between antennas at each ICR, in addition to the spatial 
correlation across the ICRs. 

Let the {M x 1) received vector at the i*'* ICR be 

Zj = Si + ni 

where is the observation of the primary signal and ~ 
CA/^(0,cr^I) is complex additive Gaussian noise independent 
of Si. Due to the effect of antenna correlation, ^ = 
E is a non-diagonal Hermitian matrix. Since each 

ICR reports its local SNR under CSS, the SNR-maximizing 
strategy in the multi-antenna scenario is to coherently combine 
the SNRs measured at each of the AI antennas via maximum- 
ratio combining (MRC). 

Given the eigenvalue decomposition T>s.i = UAU^, the 
optimal combining rule is to first decorrelate the received 
signal as = XJ^Zj, followed by MRC to yield the 
output SNR 7i = zfzi/2al g?]. Since U is unitary, the 
distribution of 7^ coincides with that of the combiner output 
SNR 7; = zfzi/2af^ in the uncorrected antenna scenario. 



The MRC output SNR 7; is the sum of the M i.i.d. per- 
antenna SNRs, each of which is a Gaussian random variable 
(in dB) with mean p{r) and variance cr^, which implies 
7i ^ N [M p{r) ^ M a^) . Assuming error-free SNR reports as 
before, the binary hypothesis test under MIMO CSS is 

The corresponding probability of false alarm can then be 
written as 

(19) 

and the false alarm probability under MIMO LSS is 

P^'a,lss = l-Q + . (20) 

Assume that the SNR/decision reports are conducted using 
one out of the AI antennas, such that the cost distribution is 
unchanged. It then follows that v^\di,x*) > v{di,x*) and 
(a;*) > (/)(a;*), where the superscript 'M' denotes metrics 
of the MIMO scenario. In other words, deploying multiple 
antennas at the ICRs enhances the diffusion of cooperation in 
the network. Furthermore, the trends and conclusions derived 
in Sec. IIV-AI can be shown to also hold for the MIMO ICR 
network. 

V. Simulation Results 

In this section, we present the results of several numerical 
experiments that investigate how conducive different decen- 
tralized ICR networks are to the diffusion of cooperation. 
The background AWGN variance at all receivers is assumed 
to be unity. In addition, unless specified otherwise we set 
the number of ICRs to n = 18, number of antennas 
per ICR as A/ = 1, the network degree distribution as 
[P(l) = 0.37 P (2) = 0.33 P(3)==0.3], D = 20m, 
the detection probability target to /? = 0.95, path-loss exponent 
a = 2.5, proportionahty constant c — 2, shadowing correlation 
p — e-o.ii?/(n-i) |[YY|| sensing parameters S = — 0.09,(t — 
3.3 ifTTI . range R = 2m, and initial cooperation probability 
x° = 0.3. 

Fig. I2] presents the evolution of diffusion metrics x* and ^* 
over time for n = 18, R. = 2m, and n = 40, R = 1.25m, 
assuming AI = 1 and an initial value of a;° = 0.2. We 
observe that for the case n = 18, a;* and ^* rapidly attain their 
equilibrium values within four and six time steps, respectively, 
and do not exhibit major changes from the initial values. On 
the other hand, the scenario of n = 40 exhibits much more 
dramatic increases overtime since CSS is more attractive given 
the lower energy cost due to R being smaller The cooperation 
metrics increase monotonically over time, despite ICRs having 
the freedom to switch back to LSS from CSS in previous time 
instants. This implies that cooperation is a mutually reinforcing 
behavior for ICRs for whom the gain from CSS outweighs the 
cost, in spite of their myopic view of the network. 
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Fig. 2. Evolution of probability of cooperation and relative density of 
cooperating ICRs over time t. 
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Fig. 3. Equilibrium probability of cooperation and relative density of 
cooperating ICRs versus R. 



In Fig. [5] equilibrium metrics x* and ^* are displayed as 
the maximum inter-ICR communication range R increases. 
As discussed in Sec. IIVI an increase R raises the cost of 
cooperation while the benefit from CSS remains unchanged, 
and this is evident from the sharp decrease from 80% to 
18% in relative density for M ~ 1. Interestingly, when 
ICRs are equipped with an additional antenna, the increase 
in R is not enough to offset the gain from CSS, and in fact 
global cooperation is the equilibrium result for the scenario of 
M = 2. 

In Fig. m equilibrium metrics x* and ^* are displayed 
for M = 1,2, as a function of the primary user detection 
probability constraint /3. Interestingly, for less stringent values 
of /3, MIMO ICRs tend to abstain from cooperation since their 
local false-alarm rate is correspondingly low and the benefit of 
cooperation does not outweigh the cost. As /3 becomes more 
stringent, there is clearly a change in MIMO ICR behavior for 
/3 > 0.8 where they become much more conducive to CSS. 

In Fig.|5] equihbrium metrics x* and ^* are displayed as the 
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Fig. 5. Equilibrium probability of cooperation and relative density of 
cooperating ICRs versus M. 



number of ICR antennas is increased. The shadowing correla- 
tion assumption strongly inhibits the increase of diffusion as 
M goes from one to three, in contrast to the ideal uncorrected 
case of p = 0. Since Af < 3 for practical mobile terminals, 
only minor gains in cooperative behavior are apparent from 
deploying multiple antennas in a realistic fading environment. 

VI. Discussion 

In this section we touch upon several directions for further 
research, ranging from network topology optimization to more 
sophisticated ICR communication models and best-response 
strategies. 

A. Network Optimization 

From the perspective of the ICR network designer, it may be 
desirable to achieve a target equilibrium relative density ^* that 
corresponds to a satisfactory ICR spectrum usage efficiency (in 
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terms of network-wide false alarm rate). Of the parameters 
listed in Sec. IIV-AI the majority are either determined by 
nature {p,a) or by spectrum regulations ((3, sensing protocol), 
which leaves the network degree distribution P{d) as a metric 
that could be manipulated after ICR deployment. There are 
potentially an infinite number of feasible degree distributions 
that yield a target equilibrium density Bettstetter ||34l 
states that for a 2D-PPP spatial model with intensity a, the 
probability that an arbitrary node has degree k for given range 
R is 



Pr(d. = fc) = 



from which we can determine the required range and the 
corresponding ICR transmit power (assumed to be proportional 
to R^") such that this probability is arbitrarily close to 1. 
Therefore, a naive approach for creating a desired degree 
distribution P(d) is to assign corresponding transmit powers 
in the same proportion. Needless to say, such an approach ne- 
glects intra-ICR interference, and a general solution remains an 
open problem (see B31 . 1461 for more sophisticated analyses 
of topology control in ad hoc networks). 

B. Imperfect Inter-ICR Reports 

While shadowing and noise impairments are considered 
on the sensing channels, the assumption of noiseless inter- 
neighbor reporting channels in this paper provides an upper 
bound on the diffusion level of a decentralized ICR network. 
This is because imperfect SNR reports will degrade the 
performance of CSS, which in turn diminishes the return 
function in (fTTT l and thus the probability of cooperation in 
( fT3] l. Nonetheless, low-rate SNR reports with sufficient error 
control coding can approach the performance promised by the 
ideal reporting channel assumption. 

A related question worth answering is the following: given 
the selfish assumption for the individual ICRs, do they have 
an incentive to behave maliciously by intentionally reporting 
false SNR values to their neighbors? The impact and detection 
of falsified reports in cooperative spectrum sensing have been 
studied extensively in recent literature BtI . |@8l- In our system 
model, if a malicious ICR overhears its neighbor's reports 
and assumes they are truthful, it can combine them with 
its own observation to privately compute Pfa.css while 
publicly reporting a false value. If these false reports increase 
the publicly-known probability of false alarm, then the gain 
from CSS is diminished and naive ICRs are less likely to 
choose cooperation. In the next iteration, the maUcious ICR 
is then less likely to receive SNR reports from its neighbors, 
which discourages future false SNR reports and shows that 
the diffusion game intrinsically rewards truthful behavior. The 
security aspects of diffusion-based CSS therefore invite further 
study, especially since the reputation-based and statistical false 
report detection techniques in pTl . ||481 may not be applicable 
under the mean-field assumption. 

C. Beyond Myopic Strategies 

We have seen that the myopic best response strategy in (fl4l 
converges to the BNE of the diffusion game. On the other 



hand, the adage "the more you know the better you can do" 
certainly holds true for the diffusion game as well. Specif- 
ically, if ICRs can obtain more information regarding their 
local neighborhood, they can conceivably make better choices 
regarding whether to take part in CSS. We briefly mention 
two such possibilities. Firstly, side information regarding the 
degrees of an ICR's neighbors can be exploited by choos- 
ing to perform CSS only with high-degree neighbors, since 
higher-degree nodes enhance diffusion as shown in Sec IIV-AI 
The best response strategy would then become dependent on 
the degrees of the neighbors POl . Secondly, additional side 
information regarding the cost realized at its neighbors can 
also be useful to an ICR. Since the cost is assumed to be 
realized once at the initialization of the diffusion process, an 
ICR can exploit this information to predict future values of 
cc* and construct a best-response strategy with memory, as 
opposed to the memoryless myopic best response approach in 
the current model. 

VII. Conclusion 

This work investigates the diffusion of cooperative behavior 
over time in a decentralized cognitive radio network with 
selfish spectrum-sensing users. The system is modeled as a 
partially connected network with a statistical distribution of 
the degree of the users, who play their myopic best responses 
to the actions of their neighbors at each iteration. We proved 
the existence of an unique Bayesian Nash Equilibrium for 
the diffusion game, provide its fixed-point characterization, 
and show it is achievable with myopic best responses. The 
impacts of network topology, channel fading statistics, and 
sensing protocol on the outcome of the diffusion process have 
been examined. Simulation results demonstrate how conducive 
different network scenarios are to the diffusion of cooperation, 
and we conclude with a discussion on additional refinements to 
reporting channel models, security aspects, and best response 
strategies. 
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